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Abstract. Given a free group F^ of rank k > 2 with a fixed set of free 
generators we associate to any homomorphism <j> from to a group G with 
a left-invariant semi-norm a generic stretching factor, A(</<), which is a non- 
commutative generalization of the translation number. We concentrate on the 
situation when tf> : F k —* Aut(X) corresponds to a free action of F k on a 
simplicial tree X, in particular, when <j> corresponds to the action of F^ on its 
Cayley graph via an automorphism of F k . In this case we are able to obtain 
some detailed "arithmetic" information about the possible values of A = A (</>). 
We show that A > 1 and is a rational number with 2fcA £ Z 2 k-i ] ^ or everv 
<p £ Aut(Ff c ). We also prove that the set of all A (</>), where tj> varies over 
Aut(Fi c ), has a gap between 1 and 1 + ^k^-h ' ant ^ va ' ue 1 ' s attained only 
for "trivial" reasons. Furthermore, there is an algorithm which, when given <j>, 
calculates A (</>). 



Contents 

1.1. Random subgroup distortion and growth of random automorphisms] 2 

^^^^^^ictign^m^^es^^y^ 5 

^ 6 

.L4^_Rarrdpm^^ 10 

2. Random words and random walks! 10 

3. Frequencies and cyclic wor 

14 

^ 16 

5™ ~ Gcnericitvl 19 

6. Whitehead's Peak Reduction and rigidit y of free group automorphismsl 23 

7. Application to the geometry of automorphism 

T~ 27 

8. Random elements in regular languages! 30 

9. Open Problems! 33 
References! 34 



2000 Mathematics Subject Classification. Primary 20F65, Secondary 37A, 37E, 57M. 
The second and the third author were supported by the NSF grant DMS#0404991 and the 
NSA grant DMA#H98230-04-l-0115. 

1 



2 



V. KAIMANOVICH, I. KAPOVICH, AND P. SCHUPP 



1. Introduction 

1.1. Random subgroup distortion and growth of random automorphisms. 

Let G be a finitely generated group with a word metric ds determined by a finite 
generating set S and write \g\s := ds(l,g) for g e G. Recall that if H = (T) 
is a subgroup of G generated by a hnite set T, then a function / is said to be a 
distortion function of H in G if for every h G H we have \h\x < f(\h\s). The 
subgroup H is quasi-isometrically embedded in G if and only if for some (and hence 
for all) choices of S, T there is a linear distortion function for H in G, that is if the 
ratio j^jj is bounded on H \ {1}. 

The translation number of an element g G G is defined as 

A(<?) = A S ( 5 ) = lim & 

and the limit exists by the subadditivity of the sequence |<7™|s- If g has infinite 
order, then the cyclic subgroup (g) is quasi-isometrically embedded in G if and 
only if As (g) > for some (and hence for any) finite generating set S of G. 

The study of "random" or "generic" behavior is currently an increasingly active 
area of investigation in many different group-theoretic contexts. (See, for example, 
[211E2[!Sl[!IE[n][I3[I3[Ill2l3H )■ In this paper we 

concentrate on algebraic and geometric consequences of subadditivity, specifically 
of Kingman's Subadditive Ergodic Theorem. 

We investigate a "noncommutative analogue" of translation number which is de- 
fined for a "mapped- in" subgroup H — </>(-F), where <f> : F — > G is a homomorphism 
of a free nonabelian group F — F(A) of finite rank into a group G with generating 
set S. Namely, there is a number A = \(<fi, A, S) > such that for long "random" 
freely reduced words w € F(A) we have sw A. (Instead of the word metric 

ds one could actually take an arbitrary semi- norm on G.) 

Throughout this paper we fix the notation that F = F(A) is the free group with 
basis A — {ai, . . . , at] where k > 2. For any w € F let \w\ denote the length of the 
unique freely reduced word over representing w. We identify the hyperbolic 
boundary dF with the set of all geodesic rays from 1 £ F in the Cayley graph 
T(F, A) of F, that is, dF is the set of all semi-infinite freely reduced words over 
A ±x endowed with the standard topology. The space dF can be identified with the 
space of ends or the hyperbolic boundary of F. 

The Borcl c-algebra T on dF is generated by the cylinder sets CylA(v), v € F, 
where CylA(v) consists of all infinite rays lu G dF that begin with v. The uniform 
Borel probability measure fiA on dF corresponding to A is defined by assigning 
equal weights to all cylinders based on the words on the same length. That is, 

^yl A {v))= 2k{2k l _ i ^_ l V«enW. 

Note that although the boundary dF could be defined without referring to a par- 
ticular generating set A, the uniformity of the measure \ia does depend on the 
choice of A. The fact that the uniform measures corresponding to two different free 
generating sets may well be singular respect to each other is actually the corner- 
stone of our approach. (See [201 f° r a detailed discussion of this phenomenon in the 
general context of word-hyperbolic groups and of the Patterson-Sullivan measures 
corresponding to geometric actions of such groups on Gromov-hyperbolic spaces.) 
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A ray to € dF can be thought of as a non-backtracking edge-path in T(F, A) 
starting from the identity 1. We denote by uj n the vertex on 10 at distance n from 
1. The measure space (dF, (j,a) then becomes the space of sample paths of the non- 
backtracking simple random walk (NBSRW) on the Cayley graph of F . This is the 
Markov chain on F whose transition probabilities Tt w , w G F are equidistributed 
among the neighbors of w which are strictly further from the group identity. By 
choosing a random /i^-distributed point to S dF we may think about uj n as a 
"random" (with respect to the NBSRW) freely reduced word of length n in F. 

In Section 2 we prove: 

Theorem A. Let F = F(ai, . . . , a^) with k > 1, and let /j,a be the uniform Borel 
probability measure on OF corresponding to the basis A = {oi, . . . , dfc}. 

Let 4> : F — > G be a homomorphism to a group G endowed with a semi-norm, 
that is, a nonnegative function \ ■ \g on G satisfying \gh\c < |<?|g + \h\c for all 
9l heH. 

Then: 

(1) There exists a real number A > such that 

n — >oc ri 

/i^-a.e. and in L 1 (dF, ha). 

(2) Suppose further that the image group (f>(F) is non-amenable, and that the 
sequence b n = #{g G 4>(F) : \g\c < n} grows at most exponentially. Then 
A > 0. 

Note that the requirement of at most exponential growth of the b n is automati- 
cally satisfied if the group G is finitely generated, and | • |g is the word metric on 
G determined by a finite generating set. 

Theorem ^ says that for a long "random" freely reduced element w G F we have 

1^1^ ~ A. For this reason we shall call the number A = A(</>, A, \ ■ \c), whose 
existence is provided by part (1) of Theorem the generic stretching factor of <f> 
with respect to the free basis A of F and the semi-norm \ ■ \g- 

We deduce Theorem [X] from the fact that the sample paths of the usual simple 
random walk on the group F asymptotically follow geodesies and the well-known 
results on the linear rate of escape of random walks on groups 37| , ■ We also give 
an alternative direct argument proof of part (1) of Theorem 1X1 applying Kingman's 
Subadditive Ergodic Theorem Part (2) of Theorem^]can also be proved using 
the results of Cohen JS], Grigorchuck |22 and Woess 01] on co-growth in groups. 

Example 1.1 (Stretching factors for isometric actions). A typical example of a 
semi- norm \ ■ \g comes from isometric group actions on metric spaces. Namely, 
let A be a metric space with basepoint x G X. For an isometry g of X define 
\g\ x := d(x,gx). The triangle inequality implies that IgiffeU < Iff lis + \92\x, so 
that I • |a; is a semi-norm on G = Lsom(X). Suppose F = F(a\, . . . ,etfc) acts by 
isometrics on A by a homomorphism <p : F — > G. It is easy to see that in this case 
\(4>, A, I • \ x ) does not depend on the choice of a base-point x G A and is determined 
by the action <f> and the choice of the basis A of F. In this case we shall denote 
X(4>, A, I • la,) by A(<fi, A), or just by A(0) if the choice of A is fixed. 

Example 1.2 (Random Subgroup Distortion). Let H < G be finitely generated 
groups with finite generating sets A C H and S C G, respectively. Denote the 
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associated length functions by | • \g and \ ■ \h- Now H is a quotient of F = F(A). 
Let (j) : F(A) -> G be composition of this quotient map with the inclusion of H 
into G. Then \<f>{w)\H < \w\ for any w G F. If the group H is non-amenable then 
by Theorem 1X1 for a long "random" freely reduced word w in F (A) 

M4^ Al >0, and MMk « A 2 > 0, 

M H 

and therefore 

\<P(w)\ H _ Ai 
|^(w)| G A 2 ' 

where the constants Ai,A2 > do not depend on w. Thus, informally speaking, 
Theorem^implies that any nonamenable finitely generated subgroup H of a finitely 
generated group G has "generic-case" linear distortion in G. 

Example 1.3 (Normal Forms). Let G be a nonamenable group with a finite gen- 
erating set A and the associated length function \ ■ \a- We will denote by w the 
element of G represented by a word w in the alphabet A U A . 

Let L C (A U A^ 1 )* be a set of normal forms (not necessarily unique) for 
elements of G, that is L = G. Consider, for instance, the case where L is an 
automatic language for G. By Theorem 1X1 there is A > such that for a random 
long freely reduced word w € F{A) we have ^£ « A. Let £ L be a word 
representing the same element of G as w. Then 

\wl\ > \w\g 

and hence 

^>*,A>0. 



\w\ \w\ 

Thus for a long random word w G ^(5*) when we take W to a normal form 
wl G the ratio is separated from zero. This conclusion applies to a number 
of experimental observations, such as those obtained by Dehornoy |17j in the case 
of braid groups. 

Theorem ^ has implications regarding the growth of random automorphisms. 
Let G be a finitely generated group with a fixed word metric corresponding to a 
finite generating set S. Let <j> £ Aut(G) be an automorphism. We define the norm 
of (j> with respect to S as 

|H| = |H| s :=max|0( S )| 5 . 
Then for any g € G we have |</>(g)|s < IHIsMs an d hence 

\<Kg)\s 

~- sup — j— j — . 

gEG.g^l \9\S 

For an individual (f> the sequence log ||0"|| is subadditive and therefore the fol- 
lowing limit (sometimes called the growth entropy of (f>) exists: 

,(0) := hm 

n — >oo Ti 

It turns out that this notion has a generalization for an arbitrary finitely gener- 
ated subgroup of Aut(G): 
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Theorem B. Let G be a nontrivial finitely generated group with a word-metric ds 
corresponding to a finite generating set S. Let H < Aut(G) be a noncyclic subgroup 
with a finite generating set T . Then: 

(1) There is v — v(FF) — v{H 1 T,S) > such that for a non-backtracking simple 
random walk cj) n on the Cayley graph of H with respect to T we have 

y 10g||</>n||s 

lim - — = v 

n — >oo 77, 

almost surely and in L 1 . 

(2) If G has polynomial growth and H is non-amenable then v(H,T, S) > 0. 

Note that v(H,T,S) > means that ||0„||s grows exponentially with n for a 
"random" automorphism <f> n . 

Corollary C. Let F be a free group of finite rank k > 2 and let H < Aut(F) be 
a finitely generated group of automorphisms of F such that the image H' of H in 
Aut(F a {,) = GL(k,Z) is non-amenable. Then for any finite generating set S of F 
and for any finite generating set T of H we have v{FL, T, S) > 0. 

By the Tits alternative a subgroup of GL(fc,Z) is either virtually solvable (and 
hence amenable) or it contains a free subgroup of rank two (and hence is nona- 
menable) . Thus in the above corollary we could replace the assumption that H ' is 
nonamenable by the requirement that H' is not virtually solvable. 

1.2. Free actions on trees: Two interpretation of stretching factors. In 

the context of free and discrete isometric actions of free groups on R-trees (cf. 
Example 11.1(1 . generic stretching factors are related to Bonahon's notion E] 
of the intersection number between geodesic currents on hyperbolic surfaces. If 
G is a non-elementary word-hyperbolic group, a geodesic current on G is a G- 
invariant positive Borel measure on d 2 G :— {(x,y)\x,y € dG,x ^ y}. The space 
of all geodesic currents on G, endowed with the weak-*-topology, is denoted by 
Curr(G). (See [7112] f° r a detailed discussion on the subject.) 

Every nontrivial conjugacy class [g] in G defines an associated "counting" current 
?7[ g ] on G. When S is a closed surface of negative Euler characteristic and G — 
TTi(S), Bonahon proved that the notion of geometric intersection number between 
free homotopy classes of essential closed curves on S (that is, between nontrivial 
conjugacy classes of G) extends to a bilinear continuous "intersection form" 

i : Curr(G) x Curr(G) -> M. 

Note that in this case dG = dM 2 = S 1 . For every hyperbolic structure p on S there 
is an associated Liouville current L p £ Curr(G) (see |S]). Bonahon's construction 
has the following natural property: if p is as above and [g] is a nontrivial conjugacy 
class in G then i(L p ,r]^) = i p (g). Here £ p : G — > K is the length spectrum of p. 

Thus i p (g) is equal to the translation length of g as an isometry of H 2 = (S, p) and 
it is also equal to the p-length of the shortest curve of the free homotopy class of 
closed curves on S corresponding to [g]. It turns out that the intersection number 
i{L p ,L p >) between Liouville currents corresponding to two hyperbolic structures 
p, p' can be interpreted as the generic stretching factor of a long random closed 
geodesic on (£>, p) with respect to p'. Namely, let p £ S and let v be a random unit 
tangent vector at p on (S, p). For every n > 1 let a n be the geodesic of length n on 
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(S, p) with origin p and with the tangent vector v at p. Let f3 n be a geodesic from 
the terminus of a n to p of length < Diam(S, p). Then 7„ — a n (3 n is a closed curve 
on S. Bonahon's results imply that 

lim = Um V([7n)]) _ i(p,p') 



n-KX) ^ p ([7„]) ?woo n 7T 2 |x(5')|' 

It turns out that a version of this interpretation applies in the context of free 
groups acting on trees. Let F be a free group of finite rank k > 2 In j3Ul l3~T] 
Kapovich investigated a natural "intersection form" / : FLen(F) x Curr(F) — > F, 
where FLen(F) is the space of hyperbolic length functions corresponding to free 
and discrete isometric actions of F on M-trees. This form still has the natural 
property that for any nontrivial conjugacy class [g] in F and any £ £ FLen(F) we 
have I(£,r]i g ] = £{g). Let A be a free basis of F and let £ £ FLen(F) be realized 
by a free and discrete isometric action <f> : F — > Isom(X) of F on an R-tree X. 
Let p,A be the uniform measure on dF corresponding to A. The measure pa on 
<9F determines a uniform current 1/4 £ Curr(F) that is analogous to the Liouville 
current corresponding to a hyperbolic structure on a surface. As shown in |31|. 
similarly to Bonahon's situation, we have 

I(£,ua) = Xa(4>). 

Generic stretching factors are also related to the notion of the Hausdorff dimen- 
sion of a measure with respect to a metric. If p is a measure on a metric space 
(M,d), the Hausdorff dimension of fi with respect to d, denoted HD<j(/^) (or just 
HD(jLi)), is defined as the infimum of Hausdorff dimensions of subsets of (M, d) of 
full measure /i. 

In |28| Kaimanovich proved that for the harmonic measure v on dT associated 
to a regular Markov operator P with a positive rate of escape on a tree T with 
uniformly bounded vertex degrees we have 

HD(i/) = - 

c 

where c is the rate of escape and h is the asymptotic entropy of P. 

This result is relevant in our context. Indeed, let A be a free basis of F and 
let <f> : F — > Lsom(X) be a free, discrete and minimal isometric action of F on an 
R-tree X. Then X/F is a finite metric graph and X is the universal cover of this 
graph. Let T(F, A) denote the Cayley graph of F with respect to A. The orbit map 
w 1—* wp (where p £ X is a base-point) gives a quasi-isometry between the trees 
T(F, A) and X which extends to a homeomorphism <j> : dT(F,A) — > dX where dX 
is metrized in the standard CAT(-l) way: d(£f) = e^^'^ for £,£ € 9X. Let 
PA be the uniform probability measure on dT(F, A) — dF corresponding to A and 
let p' A denote the push- forward of pa via <f> to dX. 

Then the result of Kaimanovich |28| mentioned above implies that 

_ log(2fc - 1) 
HDrf( ^ } - XaW ' 
where k > 2 is the rank of F. 



1.3. Main results about actions on trees. 

Our first main result is: 
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Theorem D. Let F = F(A) be a free group of rank k > 2. Let <p : F — > Aut(X) 
be a free simplicial action without inversion of F on a simplicial tree X. 
Then the following hold: 

(1) The generic stretching factor A = A(</>) is a rational number > 1 with 

1 



2kX £ Z 



2k- 1 



(2) The number X((f>) is algorithmically computable in terms of <fr, provided X 
is the universal cover of a finite connected graph and <f> is given by an 
isomorphism between F and the fundamental group of that graph. 

The most interesting case of the above theorem is when X is the Cayley graph 
of F = F(A) determined by an endomorphism of F. 

Definition 1.4 (Generic stretching factor of an endomorphism). Let F — F(A) 
where k > 2 and A — {ax, . . . , a^}. Let <fi : F — > F be an endomorphism of F . Let 
X = T(F, A) be the Cayley graph of F and consider the action 9 : F — > Isom(X) 
given by 9{w)x := 4>{w)x, where w £ F, x £ X. The generic stretching factor \a(9) 
corresponding to this action is called the generic stretching factor of tf> with respect 
to A and is denoted \a{4>) or j us t A(</>) if A is fixed. 

Thus \(4>) approximates the distortion |0(w)|.a/|'u;|.a for a long random freely re- 
duced word w in A . For instance, for the Nielsen automorphism (f> £ Aut(F(a, b)), 
4>{a) — ab,cj)(b) = b it turns out that X((f>) = If is an automorphism of 
F(ai, . . . , afc), then the precise relationship between A(</>) and the traditionally stud- 
ied dynamical properties of (/> is not very clear. Nevertheless, we are able to estimate 
the growth of \(4> n ) for hyperbolic automorphisms. Recall that <f> £ Aut(F) is hy- 
perbolic if there exist s > 1 and m > 1 such that for any w £ F 

s\\w\\ <max{||^HH,||0- m HII}- 
By a result of Brinkmann |U] an automorphism <f> £ Aut(F) is hyperbolic if and 
only if (f> does not have any nontrivial periodic conjugacy classes in F. We prove: 

Theorem E. Let F = F(ai, . . . ,at) and let <fi £ Aut(F) be a hyperbolic automor- 
phism with parameters s > 1 and m > 1 as above. Then 

liminf y/X{(j) n ) > s 1/m > 1. 

n — >oo 

It is obvious that any automorphism of a finitely generated group G equipped 
with a word metric, is a quasi-isometry and indeed a bi-Lipschitz equivalence. How- 
ever, from the geometric point of view, especially in light of various versions of the 
Marked Length Spectrum Rigidity Conjecture, it is natural to study finer features of 
quasi-isometries. Recall that a map / : (X, d) — > (X', df) is called a rough isometry 
if there is D > such that for any x,y £ X we have \d'(f(x), f(y)) — d(x, y)\ < D. 
A map / : (X, d) — > (X', d') is called a rough similarity if there are A > and 
D > such that for any x.y £ X we have \d' (f (x) , f (yj) — Xd(x,y)\ < D. It is 
interesting and natural to ask when an automorphism is a rough similarity or a 
rough isometry. 

An automorphism (f> of F — F(A) is called a relabelling automorphism if it is 
induced by a permutation of the set A — {a±, . . . , ak} ±x . We say that (f> £ Aut(F) 
is simple if it is equivalent to a relabeling automorphism in Out(F), that is, if </> is 
the composition of a relabeling automorphism and a conjugation. Note that being 
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a simple automorphism has a nice geometric meaning. Let F = F(ai, . . . ,ak) be 
realized as the fundamental group of the metric graph T which is a bouquet of k 
circles of length 1 corresponding to the generators a\, . . . , au- An automorphism <f> 
is simple if and only if, after possibly a composition with an inner automorphism , 
<j) is induced by an isometry of the graph L. 

Let P n be the uniform probability measure on the set of all elements of F of length 
n. A set W C F is said to be exponentially F -generic if linin^oo P n (W) = 1 and 
convergence to this limit is exponentially fast. Similarly, a subset C C C1Z of the set 
C1Z of all cyclically reduced words is exponentially ClZ-generic if liirin^oo P„{C) = 1 
with exponentially fast convergence, where P' n is the uniform discrete probability 
measure on the set of cyclically reduced words of length n. 

Obviously, any simple automorphism is a rough isometry and a rough similarity. 
The converse is also true, that is, any automorphism which is a rough similarity 
must be simple. (This follows, for example, from Theorem 2 of 20 together with 
some standard results about Culler- Vogtmann outer space). Here we obtain a 
strengthened "random rigidity" version of this fact: 

Theorem F. Let F = F(a\, . . . , a^) be a free group of rank k > 2 with the standard 
word metric d corresponding to the free basis {<2i, . . . , a^}. Put do := 1 + ~^ fc ■ 
There exists an exponentially ClZ-generic set C C C1Z with the following property. 
For any <fi G Aut(F) the following conditions are equivalent: 

(1) The automorphism cj> is simple. 

(2) We have A(» = 1. 

(3) We have X{4>) < 1 + 

(4) The map (f> : (F, d) — > (F, d) is a rough isometry. 

(5) The map 4> '■ {F, d) — * (F, d) is a rough similarity. 

(6) For some w £ C we have \\4>(w)\\ = \\w\\. 

(7) For every w £ C we have \ \4>{w)\ \ — \\w\\. 

(8) For some w E C we have \\<p(w)\\ < do\\w\\. 

(9) For every w S C we have \\<fi(w)\\ < do||w||. 

This result shows, in particular, that the set of all possible values of A(</>) (where 
4> G Aut(F)) has a gap, namely the interval (1, 1 + Jfezr)- Moreover, in the above 
theorem we can choose do to be any number such that 1 < do < 1 + ^k'^-k ■ 

Theorem [F] introduces a new dimension for rigidity results related to Marked 
Length Spectra on hyperbolic groups. Indeed, it is well-known that if 4> £ Aut(F) 
fixes the lengths of all conjugacy classes (that is of all cyclic words), then is a 
rough isometry of F. Theorem \F\ shows that even if <f> € Aut(F) preserves the 
length of a single "random" cyclically reduced word w then is a rough isometry 
and indeed a simple automorphism. To prove Theorem [F] we need some rather 
different tools and ideas, both algebraic and probabilistic. The key ingredient there 
is the work of Kapovich-Schupp-Shpilrain [351 about the behavior of Whitehead's 
algorithm and the action of Aut(F) on "random" elements of F. 

Using Theorem[Fjit is not hard to show that the set of generic stretching factors 
taken over all free actions of F(ai, . . . , Ok) on simplicial trees also has a gap. Thus 
we obtain: 

Theorem G. Let F = F(a%, . . . , afc) where k > 2. Let <fi : F — > Aut(X) be a free 
minimal action on F on a simplicial tree X without inversions. 
Then exactly one of the following occurs: 
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(1) There is a simple automorphism a of F such that X is <p o a-equivariantly 
isomorphic to the Cayley graph of F with respect to {ai, . . . , afe}. In this 
case A(</>) = 1. 

(2) We have A(</>) > 1 + k(2 l_ x) . 

For an automorphism <p G Aut(F) the conjugacy distortion spectrum of <j> is 

I HI 

Kapovich proved in jSU] that /(</>) is always a Q-convex subinterval of Q (that is, 
a set closed under taking rational convex combinations) with rational endpoints. 
Here we obtain: 

Corollary H. Let € Aut(F) be an arbitrary automorphism. Then the following 
hold. 



(1) Either <fi is simple and I(<fi) = 1 or, else, 1 belongs to the interior of I(4>). 

(2) There exists !«6f,io/l such that \\(f>(w)\\ = \\w\\. 

Proof. Part (1) obviously implies part (2) since, by the above mentioned result 
of (3U| I(4>) is a Q-convex subset of Q. 

To see that (2) holds, assume that 4> is not simple. Hence is not simple either. 
By Theorem El we have A(» > 1 and AO" 1 ) > 1. Part (2b) of Theorem El now 
implies that there exists w\,W2 € F, Wi ^ l,n>2 7^ 1 such that x := nu^H| > 1 
and y := ''^J^ 1 > 1- By definition of 7(0) we have x € I {(/))■ Also, with 
u — 4>~ 1 (w2) we have y = [m^TTi > 1 an d so 1/y e I(4>). Since x > 1 and 1/y < 1, 
the Q-convexity of I(<j)) implies that 1 belongs to the interior of I{4>), as claimed. □ 

We also obtain an application of Theorem [0 concerning the notion of the flux 
of an automorphism that was introduced and studied by Myasnikov and Shpilrain 
in gU]. 

Definition 1.5 (Flux). gU] Let G be a finitely generated group with a fixed word 
metric. Let 4> G Aut(G). 
For each n > define 

flux^n) := #{g e G : < n, \<f>(g)\ > n} 

and 



flux(<f>) := limsup 



flux^n) 



#{g G G : < 71} 



The sequence flux^n) and the number flux(<p) provide a certain dynamical 
"measure of activity" of an automorphism </>. As a corollary of our results in this 
paper we obtain: 

Corollary I. Let F = F(a\, . . . , <2&) &e a free group of rank k > 2, equipped with 
the standard metric. 

Then for any (f> 6 Aut(F) we have: 



flux{<f>) 



0, if (f) is a relabeling automorphism 

1, otherwise. 
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1.4. Random elements in regular languages. By definition, a language L over 
the alphabet A is regular if and only if there is a deterministic finite automaton 
which accepts the language L. It is a basic fact of formal language theory that 
the class of languages accepted by nondeterministic finite automata (NDFA) is also 
the class of regular languages. (See Hopcroft and Ullman (23-) Nondeterministic 
automata are very useful because a NDFA accepting a language L may be much 
smaller than any deterministic automaton accepting L. Such an automaton is not 
unique and choosing some finite automaton accepting L is like choosing a presenta- 
tion for a group. One can choose a "random" element in the regular language L is 
via a random walk in the transition graph of any "suitable" finite state automaton 
M accepting the language L. We make this precise in Section|Hl where we associate 
to M a finite state Markov process M' with the set of states being the set of directed 
edges in the transition graph T(M) of M. The sample space ft of M' is the set of 
semi-infinite edge-paths in T(M). Each path in T(M) (finite or infinite) has a label 
that is a word (finite or infinite) over A. If u> £ fi is such an infinite path, we denote 
by w n = w n {uj) the label of the initial segment of length n of u>. Any initial proba- 
bility distribution \i on the edge-set E(T(M)) defines a probability measure P M on 
fl. We need to impose a natural assumption on M in order to guarantee that the 
Markov process M' is irreducible. This technical assumption, which is frequently 
satisfied in practice, is made precise in the definition of a normal automaton in 
Section [SJ Again applying the Subadditive Ergodic Theorem, we have: 

Theorem J. Let M be a normal automaton over a finite alphabet A and let L — 
L(M) be the language accepted by M . 

Let 4> ■ A* — ► G be a monoid homomorphism, where G is a group with a left- 
invariant semi-metric da- Then there exists a number A = A(Af, (f>, da) > such 
that for any initial distribution /i on E(T(M)) we have 

— — " G — > A almost surely and in L 1 with respect to P u . 
n 

If the initial distribution fi is supported on the edges of T(M) originating at 
the start states of M then the word w n can be extended by a word of uniformly 
bounded length to get a word w' n £ L. We can think of w' n as a "random" element 

of L with respect to M and [i. Theorem|j]then implies that ^"^ G — > A as n — > oo 
almost surely and in L 1 with respect to [i. 

2. Random words and random walks 

Convention 2.1. Let A = {ai, . . . ,ak} be a free generating set of a free group 
F = F(A) of finite rank k > 1. For w £ F we denote by \w\a or simply \w\) 
the freely reduced length of w with respect to A. Let d(w\,W2) = \uii l W2\ be the 
associated left-invariant metric on F. By ||w||a = we denote the cyclically 
reduced length of / with respect to A, that is, the length of any cyclically reduced 
word in the alphabet A ±x conjugate to /. 

This convention, including the fixed choice of the free basis A = {ai, . . . , Ofc} of 
F, is adopted for the remainder of the paper, unless specified otherwise. 

Recall that a nonnegative function | • \q on a group G is called a semi-norm if 
for all g, h £ G we have \gh\G < |<?|g + \h\c- 

In this Section we shall prove Theorem ^ from the Introduction: 
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Theorem 2.2. Let F = F(A), and let \la — \xa{A) be the uniform Borel probability 
measure on dF corresponding to the generating set A = {ai, . . . , a/c} with k > 2. 
Let <j) ■ F — > G be a homomorphism to a group G endowed with a semi-norm | • \q. 
Then: 

(1) There exists a real number A > such that 

Iinx l*K)lo =A 

n — >oo fi 

for [iA-a.e. U) G dF and in the space L 1 (dF, ha)- 

(2) // the group 4>{F) is non-amenable, the sequence 

K = #{<? G 0(F) : \g\ G < n} 
grows at most exponentially, then A > 0. 

The condition on b n in the above theorem is always satisfied if G is a finitely 
generated group and is the word metric corresponding to some finite generating 
set of G. 

Any geodesic ray lo G dF can be identified with the non-backtracking path 
u>q,u>i, . . . in F starting from the group identity. Then the measure space {dF, ha) 
becomes the space of sample paths of the non-backtracking simple random walk 
(NBSRW) on the Cayley graph of F starting from the identity of the group. This 
is the Markov chain on F whose transition probabilities 7T/, / G F are equidis- 
tributcd among the neighbors of / which are strictly further from the group iden- 
tity. Therefore, the number A above is the linear rate of escape of the 0-image 
of the non-backtracking simple random walk on F. We shall deduce Theorem 12.21 
from well-known analogous properties of the usual random walks on groups by us- 
ing the fact that the simple random walk on the free group asymptotically follows 
uniformly distributed geodesic rays. 

Let n be a probability measure on a group G. By definition, the sample paths 
of the associated random walk {G, /i) are products g n — h\hi . . . h n of independent 
/i-distributed increments h n . In other words, the measure P in the space of sample 
paths which describes the random walk (G, /x) is the image of the product measure 
fj, ® fi ® • ■ ■ in the space of increments under the above product map. 

The following statement is known as Kingman's Subadditive Ergodic Theo- 
rem 130]. (See also [T5] for a short proof.) 

Proposition 2.3 (Subadditive Ergodic Theorem). Let (fi, J 7 , fj,) be a probability 
space and let S : fi — * fi be a measure-preserving operator, that is such that for any 
measurable set Q C fi we have /i(Q) = n{S~ 1 Q). 

Let X n : Q — > R be a sequence of non-negative integrable random variables such 
that for any n, m > 

X n+m (cu) < X n {u) +X rn (S n Lu), a. e. uj G fi. 

Then there exists a S-invariant random variable A : Q — > R such that 

lim — — = A 

n — >oo fi 

almost surely and in L 1 on fl. 

In particular, if S is ergodic then A = const on Q. 

A straightforward application of Kingman's Subadditive Ergodic Theorem gives: 
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Proposition 2.4 ( 26 ). // the measure fi has a finite first moment ^ |g|/i(<7) with 
respect to a semi-norm \ ■ \ on the group G, then there exists a number c > ( called 
the linear rate of escape of the random walk (G, fi) with respect to the semi-norm 
| • \) such that \g n \/n — > c for P-a.e. sample path (g n ) and in the space L 1 (P). 

The following claim, if slightly more general than the one formulated in |26| . can 
be obtained in the same way by using the spectral characterization of amenability 
(or by showing that c — implies vanishing of the asymptotic entropy of the random 
walk, and therefore amenability of the group |37|): 

Proposition 2.5 (26 j). Under the assumptions of Proposition ^^ if the group G 
is non-amenable and the semi-norm \ ■ \g has exponentially bounded growth and the 
support of the measure p, generates the group G, then c > 0. 

Let now p! A be the probability measure on the free group F equidistributed on 
the set A ±x , so that n' A (af 1 ) = l/2k for i = 1, 2, . . . , k. 

Proposition 2.6 (see j2H) and the references therein). For P-a.e. sample path 
(g n ) of the random walk (F, fi' A ) 

(1) There exists a limit 

goo = limg„ € dF, 

n 

and its distribution (i.e., the image of the measure P under the map (g n ) i— > 
9oo) coincides with the uniform measure [ia on dF. 

(2) We have 



n n k 
(so that the linear rate of escape of the random walk (F, n' A ) is -^p- ). 
(3) We have 

d(g n , (ffoo)[0n]) = o(n), 
where [x] denotes the integer part of a number x. 

Proof of Theorem \2.2l Consider the random walk (F,f/ A ). Its image under the 
homomorphism cj> is the random walk on the group 4>(F) governed by the measure 
4>(fi' A ). Denote its rate of escape with respect to the semi-norm | • \q by c. Then 
the combination of Proposition 12.41 and Proposition 12.61 implies the first part of 
Theorem 12.21 Indeed, for P-a.e. sample path (g n ) the distance in F between g n 
and (goa)[8n] 1S sublinear, whence the distance in G (with respect to the semi- metric 
determined by the semi- norm | • |g) between the ^-images of these points is also 
sublinear. Since the distribution of ^ is f/,A, we arrive at the conclusion that the 
first part of Theorem 12 . 21 holds for the number A = c/8. 

The second part of Theorem l2.2l is now an immediate corollary of Proposition ^. 51 

Here is another argument establishing part (1) of Theorem 12.21 as a direct con- 
sequence of the Subadditive Ergodic Theorem. 

Let fi = dF. Recall that for u> £ OF we denote by u> n the element of F that 
is at distance n from 1 along the geodesic ray u> in T(F,A). Let X n : dF — * K 
be defined as X n (cu) :— |0(w„)| G . Also, let S : dF — > dF be the standard shift 
operator consisting in erasing the first letter of a semi-infinite freely reduced word 
representing an element of dF. It is well-known that S is stationary and ergodic. 

Note that for any oj S dF we have 

w n +m = LU n (S n U>) 
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Hence 

|^K +m )|G = |0K)0((5 n w) m )| G < |0(w„)| G + \<t>((S n cj) m )\ G - 

Thus the conditions of the Subadditive Ergodic Theorem are satisfied and part (1) 
of Theorem E2 follows. □ 

The following is Theorem [B] from the Introduction. 

Theorem 2.7. Let G be a nontrivial finitely generated group with a word-metric ds 
corresponding to a finite generating set S . Let H < Aut(G) be a noncyclic finitely 
generated group with a finite generating set T . Then: 

(1) There is v = v{H) = v{H, T,S) > such that for a non-backtracking simple 
random walk <j) n on the Cayley graph of H with respect to T we have 

v log 1 1^| Is 
lim = v 

almost surely and in L 1 . 

(2) If G has polynomial growth and H is non-amenable then v(H,T, S) > 0. 

Proof. It is clear from the definition of || • ||g that for any </>, ip E Aut(G) we have 

\\H\\s<M\sM\s 

and hence 

log 1 1#| |s < log 1 1^1 Is + log I M |s. 

Also, for any <f> G Aut(G) we have ||^||s > 1 and so log | ]0| |s > 0- Thus log 1 1 • ||s 
is a semi-norm on Aut(G) that uniquely extends to a left-invariant semi-metric of 
Aut(G) and thus on H < Aut(G). Hence part (1) of Theorem 12 . 71 follows directly 
from part (1) of Theorem lAl 

To see that part (2) holds suppose that H is nonamenable and that G has 
polynomial growth. This implies that (H, \ \ ■ ||s) has at most exponential growth. 
Hence part (2) of Theorem 12 . 71 follows from part (2) of Theorem lAl □ 

Remark 2.8. The requirement of G having polynomial growth in Theorem iBl is 
important and cannot be easily dispensed with. If G is a group and g € G, denote 
by ad(g) G Aut(G) the inner automorphism of G defined as ad(g)(x) — gxg~ x 
for every x G G. Now let G = F{a Xl . . . , a k ) and H = Inn(F) < Aut(F) be the 
(non-amenable!) group of inner automorphisms of F with the generating set T = 
{ad(ai), . . . , ad(ak)}- Then for any product 4> n of n elements of T we have \ \4>n\ \ < 
2n + 1. Since lim n ^oo log2 r " +1 = 0, we see that v(H,T,S) = 0. Nevertheless, in 
some instances quotient group considerations still imply that v(A) > even if G 
does not have polynomial growth, or, equivalently, G is not virtually nilpotent. 

We obtain Corollary \C\ from the Introduction: 

Corollary 2.9. Let F be a free group of finite rank k > 1 and let H < Aut(F) be 
a finitely generated group of automorphisms of F such that the image H' of H in 
Aut(F a b) = GL(k,1) is nonamenable. Then for any finite generating set S of F 
and for any finite generating set T of H we have is(H, T, S) > 0. 

Proof. Let S' be the image of S in the abelianization Z fc = F a b of F. For any 
<j> € Aut(F) the automorphism cf> of F factors through to an automorphism cf)' of 
F ab . Clearly \\<f>\\ s > \\<j>'\\ S '. Hence v(H,T,S) > v(H',T',S'), where T is the 
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image of T in Aut(F a b). Since F a b has polynomial growth, by Theorem we have 
v{H', T', S") > and hence u(H, T, S) > 0. □ 

3. Frequencies and cyclic words 

The following convention is fixed until the end of the paper unless specified 
otherwise. 

Convention 3.1. As before, let k > 2 and let F — F(A) where A = {ai, . . . , a^}. 
Let A = A ±x . We denote by C7Z the set of all cyclically reduced words in F. 

A cyclic word is an equivalence class of nontrivial cyclically reduced words, where 
two nontrivial cyclically reduced words are equivalent if they are cyclic permutations 
of each other. If v is a cyclically reduced word, we denote by (v) the cyclic word 
defined by v. Recall that if u is a freely reduced word, we denote the length of u 
by \u\ and the length of the cyclically reduced form of u by If w = (v) is a 

cyclic word then ||w|| = ||w|| is the length of w. 

Note that the set of cyclic words is naturally identified with the set of nontrivial 
conjugacy classes of elements of F. 

Definition 3.2 (Frequencies). Let w be a cyclic word. 

Let v be a nontrivial freely reduced word. We define n w (v), the number of 
occurrences of v in w as follows. Let w = [z). Take the smallest p > such that 
|z p_1 | > 2\v\ and count the number of those i,0 < i < \\w\\ such that z p = z\vzi 
where \z\\ = i. This number by definition is n w (v). If v = 1, we define n w {\) := 

IHI- 

There is a more graphical way of defining n w (v) for a nontrivial cyclic word w. 
We will think of w as a cyclically reduced word written on a circle in a clockwise 
direction without specifying a base-point. Then n w (v) is the number of positions 
on the circle, starting from which it is possible to read the word v going clockwise 
along the circle (and wrapping around more than once, if necessary). 

For any freely reduced word v we define frequency of v in w as: 

, , x n w (v) 

Uv) = NT 

Also, if w is a nontrivial freely reduced word, and v is another nontrivial freely 
reduced word, we define n w (v), the number of occurrences of v in w, as follows. 
If | w| = n > then by definition n w (v) is the number of those z,0 < i < n for 
which w decomposes as a freely reduced product w = w'vw" with \w'\ — i. Thus if 
\v\ < \w\ then necessarily n w (v) = (unlike the situation when w is a cyclic word). 

Lemma 3.3. Let w be a nontrivial cyclic word. Then: 

(1) For any m > and for any freely reduced word u with \u\ = m we have: 

n w (u) = ^2 n w (ux) = ^2 n w (xu), 

x^A,\ux\ — \u\-\-l xGA,\xu\ — \u\-\-1 



and 



fw{u) = ^2 fw(ux) = ^ fw(xu). 

x£A,\ux\ = \u\-\-l x£A,\xu\ = \u\-\-1 
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(2) For any m > 1 

^ n w (u) = \\w\\ and ^ f w {u) = 1. 

|u|= m m 

(3) for any s > and any u £ F 

n w s(u) — sn w (u) and f w *(u) = f w (u). 

Proof. Parts (1) and (3) are obvious. We establish (2) by induction on m. For 
m = l the statement is clear. Suppose that m > 1 and that (2) has been established 
for m — 1. 
We have: 



n w {u) = n w (vx) = n w (v) = \\w\\, 

\u\=m |n| =? n-i,xeA: \v\=m-l 

\vx\—m 

as required. 

□ 

Definition 3.4 (Nielsen automorphisms). A Nielsen automorphism of F is an 
automorphism r of one of the following types: 

(1) There is some i, 1 < i < k such that r(a;) = and r(aj) = aj for all 
3 ^ i- 

(2) There are some 1 < i < j < k such that r(ai) = aj, r(aj) = and 
r(a;) = a/ when I ^ i,l =/= j . 

(3) There are some 1 < i < j < k such that r(ai) = a^aj and T(a;) = a; for 
Z ^ i. 

It is a classical fact that the set of all Nielsen automorphisms generates Aut(F). 
The following proposition proved by Kapovich in |3(J| is crucial for our arguments. 

Proposition 3.5. Let cf> £ Out(F) be an outer automorphism and letp> be such 
that (f> can be represented, modulo Inn(F), as a product ofp Nielsen automorphisms. 

Then for any freely reduced word v £ F with \v\ = m there exists a collection of 
computable integers c(u, v) = c(u, v, <j)) > 0, where u £ F , \u\ = 8 p m, such that for 
any nontrivial cyclic word w we have 



n<(,(w)(v) = ^ c ( u , v ) n w(u). 

|u|=8Pm 

Corollary 3.6. Let (f> be an automorphism of F and let p be such that (f> can be 
written as a product of p Nielsen automorphisms. 

There exists a collection of integers e(v) = e(v, <p) > 0, where v £ F, \v\ = 8 P , 
such that for any cyclic word w we have: 



\v\=8p 



e(v)n w (v) 



and 



Mw)\\ 
INI 



\v\=8p 
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Moreover, there is an algorithm which, given <f> and u, computes the numbers 
e(v). 

Proof. Since ||^>(io)|| = n^^^x), the statement follows directly from Proposi- 
tion EH □ 

The following well-known fact is a version of the so-called "Bounded Cancellation 
Lemma" (see |16|1: 

Lemma 3.7. Let a be an injective endomorphism of F. There is N — N(a) > 
such that for any cyclically reduced word w the maximal terminal segment of a(w) 
that cancels in the product a(w) ■ a(w) has length at most N . 

4. Actions on trees 

Let r be a finite connected graph with orientation ET — E + TUE~r. For e £ E 
we use the following notation. The inverse edge of e is denoted by e, o(e) denotes 
the initial vertex of e and t(e) denotes the terminal vertex of e. 

Let F be a free group and let (f> : F — > 7ri(r,p) be an isomorphism between F 
and the fundamental group of T with respect to a vertex p. Let T be a maximal 
tree in T. For any vertex v, let [p, v]t denote the path in T from p to v. The choice 
of T define a basis St of iri(T,p) as follows: 

S T := { [p, o(e)] T e [t(e),p] T :e£E+(T- T)} 

The (/>-puHback of this basis Bt := 4>~ 1 (St) is a basis of F referred to as the 
geometric basis of F determined by T. 

Let s e := [p, o{e)]x e [t(e),p]T where e S E(T — T), so that s g = s^ 1 . Let 
b e = _1 (s e ), where e € £7(r — T), so that again b s = b^ 1 . 

The following obvious lemma indicates the explicit correspondence between freely 
reduced words in St (or Bt) and reduced edge-paths in F. 

Lemma 4.1. (1) Let 7 be an edge-path in T from p to p. Let u be a word in St 
constructed from 7 as follows: delete all the edges of T from T and replace 
each edge e £ E + (T — T) in 7 by s e and each edge e £ E~(T — T) in 7 by 
Sg . Then u — 7 in 7Ti(r,p) and u is a reduced word in St if and only if 
7 is a reduced path, x 

(2) Let u be a word in St U S^ 1 , where e% = ±1. 

Construct the path 7 from p top as follows. First for each e £ E + (T — T) 
replace every s e in u by e and replace every s" 1 by e. Then between every 
two consecutive e, e' insert the path [t(e), o(e')]x- Finally append the path 
\p, o(e)]r in front, for the first edge eo £ E(T — T) of the resulting sequence, 
and append the path [t(e' ),p]T at the end for the last edge e' £ E(T — T) 
of the sequence. 

Then 7 is a path from p to p that is equal to u in ni(T,p) and that is 
reduced if and only if the word u over St is reduced. 

(3) Let 7 be a closed edge-path in T. Let u be a word in S^ 1 obtained from 7 as 
in (1). Then the loop atp corresponding to u in 7i"i(r,p) is freely homotopic 
to 7 in r and the word u is cyclically reduced if and only if the path 7 is 
cyclically reduced. 
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(4) Let w be a cyclic word in S^ 1 . Let 7 be a circuit in T obtained as follows. 
Replace each occurrence of s e in w by e and each occurrence of s^ 1 by 
e; after that between each two consecutive (in the cyclic order) edges e,e' 
insert the path [t(e), o(e')]T- Then w and 7 represent freely homotopic loops 
in Y and the cyclic word w is reduced if and only if the circuit 7 is reduced. 

Now suppose that T is endowed with the structure of a metric graph, that is, 
each edge e of Y is assigned a length 1(e) > in such a way that 1(e) = £(e) for each 

edge e. Let X = (Y,p) be the universal cover of T. Then X inherits the structure 
of a metric tree with an isometric action of 7Ti(r,p) and, via cj>, an action of F on 
X. 

Let p be a lift of p to X. For g £ iri(Y,p) let \g\ p :— dx{P,gp)- Also denote by 
||f/||x the translation length of g when acting on X. Similarly, if w is a conjugacy 
class (or a cyclic word) in iri(Y,p), we denote by ||u>||x the translation length of u 
with respect to X. For each freely reduced word z = s\s\, of length two in Si^ 1 , 
where e,S S {1,-1}, denote by r z the length of the edge-path [i(e e ), o(e' S )]T in 
r. Let Z be the set of all freely reduced words of length two in St- For each 
a = s e <S St denote e(a) := and e(a _1 ) :— e. 

Lemma 4.2. (1) Let w be a reduced cyclic word in S^ 1 . Then 
|MU= X! £( e ( a )) n w(a) + ^ r z n w (z). 

(2) Let u be a freely reduced word in St- Then 

\v,\ p = e(e(a))n u {a) + J2r z n u {z)+£(\p,o(e)] T )+e([t(e'),p]T) 

where e and e' are the last and the first edges 0/7(11) accordingly. 

The following is Theorem iDl from the Introduction: 

Theorem 4.3. Let F — F(ai, . . . , a^), where k > 2, and let A = {ai, . . . , a^}. 
Then for any free action <p of F on a simplicial tree X without inversions the 
generic stretching factor A(</>) = \a(4>) is a rational number with 

2fcA(0) e Z[— Ly]. 

Moreover, if X is given as the universal cover of a finite connected simplicial 
graph T and if the action <j) is given via an explicit isomorphism between F and 
7Ti(r,p), then \((f>) is algorithmically computable in terms of 4>. 

Proof. Recall that the definition of X(cf>, | • | x ) does not depend on the choice of a 
point x € X . Hence we may assume that x is a vertex of the minimal F- invariant 
subtree of X and therefore, that the action of F on X is minimal. Let Y = X/F 
be the finite quotient graph. Choose an orientation on T, a maximal tree T in 
r. Choose a base-vertex p in Y to be the image of x S X in T. Note that in 
both X and Y every edge has length 1. Then there is a canonical isomorphism 
ip : F — > m(Y,p). Let St and Bt be the geometric bases corresponding to T for 
7Ti(r,p) and F accordingly. 

Fix a bijection between Bt and A — {ai, . . . , a^} and an automorphism a of F 
induced by this bijection of the two free bases of F. 
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Let g = xi . . . x n € F be a, freely reduced word of length n in F(ai, . . . , afc). Let 
g' be a cyclically reduced word of length n over A obtained from g by changing the 
last letter of g if necessary. Thus Ig'g" 1 !^ < 2. 

Let w' be the cyclic word over A defined by g' . Let w be the result of rewriting 
w' as the cyclic word in Bt- Then there is an integer M > 1 such that for each 
freely reduced word z in Bt of length at most 2 

n w (z) = E c(u,z)n w (u) 

\u\ A =M 

where c(u, z) > are some integers independent of w. Let Zi be the set of freely 
reduced words of length i in Bt, for i = 1, 2. 
Then 

Hs'Hx = IMU = X! ^( e ( a )) n ™(a) + X! r z n ™( z ) = 
E E £(a)c(u, a)n«,/(tt) + E E r z c(u, z)n w >(u), 

aeZi \u\ A =M z£Z 2 \u\ a =M 

It follows from Lemma ETTl and Lemma rOl that if h £ F is cyclically reduced over 
Bt then ~~ I^U| < N where N — N(x) > is some constant independent 

of h. On the other hand by the Bounded Cancellation Lemma ( Lemma IX7|) there 
exists a constant N' > such that for any cyclically reduced word y over A, we 
have |||y||s T — |y|s T < N'. By construction g' is cyclically reduced over A and 
Ig'g -1 ^ < 2. Hence there exists a constant L > such that for every g as above 
and each u £ F with |ii j ^ = M we have | IffU - Hff'l |x | < L and |n 9 (w) — (u)| < L. 

Therefore there is another constant L' > independent of / such that for every 
freely reduced word g of length n over A 



(*) 



£ £ *(e(o)) C (u,o)/ 9 (u)+X; E r z c(u,z)f g (u)-^ 

aeZi \u\a = M z£Z 2 \u\a = M 



< 



V 



If g n is a freely reduced word of length n obtained by a non-backtracking simple 
random walk of length n on F(ai, . . . , a^) then for each u £ F(ai, . . . , dfc) with 
\u\a = M we have 



n — >oo 

Therefore (*) implies that 



1 



2fc(2fc - 1) 



M-l 



almost surely. 



^ ' aeZi \u\ A =M zeZ 2 \u\ A =M 

Since i[e(a)) = 1, c(u, a), c(u, z) and r z are integers, it follows that A(</>) is rational 
and, moreover, that 

2*A(0eZ[^- T ]. 

Moreover, A(</>) is computable in terms of an explicit isomorphism between i* 1 
and 7Ti(r,p). □ 
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Remark 4.4. The formula (**) for A(0) holds for an arbitrary structure of a 
metric graph on T, where the lengths of edges are allowed to be arbitrary positive 
real numbers and not necessarily 1. If the length of all edges of T are rational, then 
by (**) X(4>) is also rational. Moreover, if these length of the edges are given to us 
in some algorithmically describable form then X(cj)) is computable in terms of these 
lengths and of an an explicit isomorphism between F and Tti(T,p). 

5. Genericity 

Convention 5.1. Recall that C1Z denotes the set of all cyclically reduced words in 
F = F(ai, . . . , ttfe). If S C F and n > we denote 



p(S,n) := #{w G S : \w\ < n}, 



and 



y(S,n) := #{w e S : \w\ = n}, 

Let P n be the uniform discrete probability measure on the set of all elements w G F 
with \w\ — n. We extend P n to F by setting P n (w) = for any w G F with \w\ ^ n. 

Similarly, let P' n be the uniform discrete probability measure on the set of all 
cyclically reduced elements w G F with \ \w\\ — n. We extend P n to C1Z by setting 
P^{w) = for any w G C1Z with ||w|| ^ n. 

Thus 7(n, F) = 2fc(2fc - l)"- 1 for n > 0. 

For a number sequence a;„ with limn^oo i„=i6l we say that the convergence 
is exponentially fast if there exist < a < 1 and D > such that for all n > 1 we 
have |x„ — x| < Da n . 

Definition 5.2 (Genericity). Let S C W C F. We say that 5 is exponentially 
W -generic if 

Um Tfeg. = i 

and the convergence is exponentially fast. The complement in W of an exponentially 
W-generic set is called exponentially W -negligible. 

In practice we are only interested in the cases W = F and W = C1Z, the set 
of all cyclically reduced words in F. By definition S C _F is exponentially F- 
generic if and only if limn^oo P n (S) = 1 with exponentially fast convergence in 
this limit. Similarly S C C7?. is exponentially C7?.-generic iff lim n ^oo F^ (S) 
1 with exponentially fast convergence. Here there is a simple criterion of being 
exponentially negligible 35 in F and C1Z: 

Lemma 5.3. Let W = F orW = C7Z. Then for a subset S C W i/ie following are 
equivalent: 

(1) T7ie se£ 5 exponentially W -negligible. 

(2) FFe We 



(3) VKe /iaue 



(2fe-l)« 
(2fe-l) n 



exponentially fast, 



exponentially fast. 
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(4) We have 



lim sup Vp(n, S) <2k-l. 

n — too 

(5) We have 

lim sup \ry(n, S) < 2k — 1. 

n — >oo 

Proposition 5.4. Let e > and let m > be an integer. Then the set 

W(m, e) := {w € F : for any u ^ 1 wii/i |u| = m we have 

l /tu(w) " 2fc(2fc-l)"-i' <£} 
is exponentially F '-generic. 

Proof. This is a straightforward corollary of Large Deviation Theory |18j applied 
to the finite state Markov chain generating the freely reduced words in F. We refer 
the reader to for a more detailed discussion about large Deviation Theory and 
how it works in this particular case. □ 

It is not hard to deduce the following from Proposition [O] 

Proposition 5.5. Let e > and let m > be an integer. Then the set 

C{m, e) := {w G C1Z : for any 1 with \u\ — m and for the cyclic word (w) 
we have \f (w) (u) - 2fc(2fc ^ 1)m -i l < e l 
is exponentially ClZ-generic. 

We now give the definition of an "approximate" stretching factor, which will 
later be seen to be equivalent to the generic stretching factor of an automorphism 
introduced earlier. 

Definition 5.6. Let <j) : F — > Aut(X) be a free simplicial action without inversions 
of F = F(ai, . . . , dfc) on a simplicial tree X . 

We say that a number A > is a approximate stretching factor of <p if for every 
p E X and for any e > the set 

{ W EF:\^-\\<e} 
\w\ 

is exponentially generic in F. 

Similarly, we say that a number A > is a approximate conjugacy stretching 
factor of 4> if for any e > the set 

{weC ^ : |Mk_ A |< e} 
HI 

is exponentially generic in C1Z. 

Proposition 5.7. Let <f) : F — > Aut(X) be a free simplicial action of F = F(ai, . . . , at) 
on a simplicial tree X . 

(1) There is at most one approximate stretching factor for <f>. 

(2) There is at most one approximate conjugacy stretching factor for <f>. 
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(3) // A is an approximate conjugacy stretching factor for <f> then A is also an 
approximate stretching factor for cf>. 

(4) If X is an approximate stretching factor for <p then A is also an approximate 
conjugacy stretching factor for <f>. 

Proof. Parts (1) and (2) are obvious. 

We now establish (3). Indeed, suppose that A is an approximate conjugacy 
stretching factor for <f>. Let e > and let S be the set of all cyclically reduced 
words w such that | iT^rr — A| > e/2. Then S is exponentially C7?.-negligible, so 
that 



(2k - 1)» U 

exponentially fast. Let p G X. Put M = max{|aj| p : 1 < i < k}. Let TV > be an 
integer such that for any cyclically reduced word u we have — < N. 

Let S' be the set of all freely reduced words w in F that differ from an element of 
S in at most the last letter. Then j(n, S') < 2kj(n, S) and hence S' is exponentially 
-F-negligible by Lemma f5. 31 

Suppose w G F — S' is such that N ^ M < § • Let u be a cyclically reduced 
word obtained from w by changing at most the last letter. Then \u\ = \w\ and 
ueCK-S. 

Thus (1a(w,u) < 2 and hence dx(wp,up) < 2M. Thus ||tt| p — |w| p | < 2M. Also 
\\\u\\x - Hp\ < N. Therefore |||w||x - \w\ p \ < N + 2M. Since u e CK - S, we 
have | M|x — A||u|| < e||«||. Since ||«|| = |u| = \w\, we have: 



w\ p - \\w\ \ < e\w\ +N + 2M 



p 

w 
w 



e N + 2M 

K 2 + ^r <e - 



The set {w £ F : ^±2M > 1} [ s finite. Hence S' U {w e F : > is 

L \w\ — 2 J L \w\ — 2 J 

exponentially F-negligible and assertion (3) holds. 

The proof of (4) is similar to that of (3) and we leave the details to the reader. □ 

Theorem 5.8. Let F — F(ai, . . . , a&) and let <f> : F — > Aut(X) be a free simplicial 
action of F — F(a±, . . . , a^) on a simplicial tree X . 

Then the generic stretching factor \(<p) is also an approximate conjugacy stretch- 
ing factor (and thus by Proposition \5. 7| an approximate stretching factor). 

Proof. The proof is very similar to that of Theorem 14.31 Since the minimal F- 
invariant subtree of X contains the axes of all the nontrivial elements of F, we may 
again assume that the action of F on X is minimal. 

Choose a vertex x G X. Recall, that, using the notations from the proof of 
Theorem 14.31 for any w G F 

(!) |E E <u,a)f w (u)+Y^ E r z c(u,z)f w (u)-^\<^. 

aeZx \u\ A =M zeZ 2 \u\ A =M 

It follows from Lemma l4~Tl and Lemma l4~2l that if w G F is cyclically reduced over 
Bt then |||iy||x — \ w \x\ < N where N — N(x) > is some constant independent 
of w. On the other hand by the Bounded Cancellation Lemma ( Lemma l3~7l) there 
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exists a constant N' > such that for any cyclically reduced word w over A, we 
have |||w||_b t — |w|_b t | — N'. Hence for any cyclically reduced word w over A we 
have ||M|x — \ w \*\ — N" where N" — N"(x) > is some constant independent of 
w. 

Therefore for any cyclically reduced w € F over A with ||u;|| = n 
(*) E E c ( u ^ a )fm{u) + Y E r zc(u,z)f w (u) - 

aEZi \u\ A =M zeZ 2 \u\ A =M 

Let e > We know that the set 



V + N 
n 



C(M, e) := {w S C1Z : for any u^l with \u\ = M and for the cyclic word (w) 

1 



we have 



f(w)(u) 



2k(2k - 1 



,M-1 



is exponentially C7?.-generic. 

Hence (*) implies that for any w € C(M, e) 



Z l)M-i S 51 c( U ,a)+^ ^ r z c( W ,z) 



< ^ +Nl e. 
n 



2fc(2fc - 1) M -! ^^ v ' y, ^^ zv 'M n 

v y yoeZi \u\ a =m zez-2 \u\ a =m ' 

for some constant N\ > independent of w and e. 
Thus by definition the number 



v ' yaG^i |u|a=M 26Z 2 \u\ A =M 

is an approximate conjugacy stretching factor for (p. In the proof of Theorem 14.31 
we obtained the same formula for X((f>). □ 

Lemma 5.9. Let F = F(a±, . . . , a&) and let 4> : F —> Aut(X) be a free simplicial 
action of F = F(a\, . . . , eife) on a simplicial tree X . Let \i > 0. 

Suppose there exists an exponentially ClZ-generic set S such that for any w £ S 



Then X(4>) > /x. 

Proof. Suppose, on the contrary, that \(<fr) < fi. Choose e > such that \(<fi) + e < 

Then there is an exponentially C7?.-generic set R of cyclically reduced words such 
that for any aiGiJ 

lMx <X + e. 



IFI, 

The intersection S 1 n i? is exponentially C7\L-generic and hence nonempty. Take 
w e SnR. 
Then 
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. Mix . . J ^ 

HI 

yielding a contradiction. □ 

6. Whitehead's Peak Reduction and rigidity of free group 

automorphisms 

We need to recall some definitions related to Whitehead's algorithm for solving 
the automorphic equivalence problem in a free group. We refer the reader to (HHI I43j 
for a detailed exposition. 

Definition 6.1 (Whitehead automorphisms). A Whitehead automorphism of F is 
an automorphism r of F of one of the following two types: 

(1) There is a permutation t of A such that r| j = t. In this case r is called a 
relabeling automorphism or a Whitehead automorphism of the first kind. 

(2) There is an element a £ A, the multiplier, such that for any x £ A 

t(x) S {x, xo, a~ 1 x, a~ l xa\. 

In this case we say that r is a Whitehead automorphism of the second kind. (Note 
that we always have r(a) = a in this case since r is an automorphism of F.) To 
every such r we associate a pair (S, a) where a is as above and S consists of all 
those elements of A, including a but excluding a" 1 , such that r(x) G {xa, a _1 a;a}. 
We will say that (S, a) is the characteristic pair of r. 

Note that for any a £ A the inner automorphism ad(a) is a Whitehead automor- 
phism of the second kind. 

The following important result of Whitehead is known as the "peak reduction 
lemma" : 

Proposition 6.2. Let u,v be cyclic words with \ \u\ \ < \\v\\ and let </> £ Aut(F) be 
such that 4>{u) = v. Then we can write (j) as a product of Whitehead moves 

<f> = T p ...Ti 

so that for each i = 1, . . . , p 

\\n...r 1 (u)\\<\\v\\. 
Moreover, if \\u\ \ < \ \v\\ then the above inequalities are strict for all i < p. 

Definition 6.3 (Weighted Whitehead graph). Let w be a nontrivial cyclically 
reduced word in A*. The weighted Whitehead graph T w of w is defined as follows. 
Let (w) be the cyclic word defined by w. The vertex set of T w is A. For every 
x,y G A such that x ^ y^ 1 there is an undirected edge in T w from a; -1 to y labeled 
by the sum n w (xy) := n (w) (xy) + n (to) (y^ 1 x _1 ). 

There are k(2k — 1) undirected edges in T w . Edges may have label zero, but 
there are no edges from a to a for a £ A. It is easy to see that we have T w = T v 
for any cyclic permutation v of w or w -1 . 

Convention 6.4. Let w be a fixed nontrivial cyclically reduced word. For two 
subsets X,Y C A we denote by X.Y the sum of all edge-labels in the weighted 
Whitehead graph T w of w of edges from elements of X to elements of Y. Thus for 
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x € A the number x. A is equal to n w (x) + n w (x 1 ), the total number of occurrences 
of x^ 1 in w. 



The next lemma, which is Proposition 4.16 of Ch. I in [2B|, gives an explicit 
formula for the difference of the lengths of w and t(w), where r is a Whitehead 
automorphism. 

Lemma 6.5. Let w be a nontrivial cyclically reduced word and let t be a Whitehead 
automorphism of the second kind with the characteristic pair (5, a). Let S' = A — S . 
Then 

\\r(w)\\-\\w\\=S.S'-a.A. 

The following important notion was introduced by Kapovich, Schupp and Shpil- 
rain in 35 . 

Definition 6.6 (Strict Minimality). A nontrivial cyclically reduced word w in F 
is strictly minimal if for every non-inner Whitehead automorphism r of F of the 
second kind we have 

\\t{w)\\ > |HI- 

The set of all strictly minimal elements in F is denoted SM. 

An immediate consequence of the Peak Reduction Lemma is: 

Proposition 6.7. |35| Let w £ F be a cyclically reduced strictly minimal element. 
Then w is of minimal length in its Aut(F)-orbit and for any 4> £ Aut(F) we have: 

M HMI < II0MII < I0HI- 

Theorem 6.8. Put c := 1 + jfrrffc- There exists an exponentially F -generic set 
W C F with the following property. 

For any (f> £ Aut(F) the following conditions are equivalent: 

(1) The automorphism (j) is simple. 

(2) We have \{<j>) = 1. 

(3) We have A(0) < 1 + 

(4) For some w € W we have \\<p(w)\\ = \\w\\. 

(5) For every w £ W we have \\<p(w)\\ = \\w\\. 

(6) For some w £ W we have \\4>(w)\\ < co||w||. 

(7) For every w £ W we have \\<p(w)\\ < co||io||. 

Proof. It is obvious that (1) implies (2) and that (2) implies (3). 
We will now prove that (3) implies (1). 
Let 4> £ Aut(F). 

Let e > be arbitrary. Put T(e) be the set of all cyclically reduced words w 
such that: 

• For any x £ A \ f w (x) - ^| < e/2. 

• For any x,y £ A with x ^ y^ 1 \f w (xy) - 2fc(2 * ^ | < e/2 

Then T(e) is exponentially C7\L-generic Moreover, every w £ T(e) is strictly 
minimal [33], provided that e < fc( 2 fc-i)(4fc-3) • 

Suppose now that e < e : = fc(2fc~i)~(4fc-3) ■ Choose an arbitrary element w £ T(e) 
and denote n — \\w\\. 
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By strict minimality of w we have ||io|| < ||0(w)||. Moreover, by Proposition l6.2l 
(Peak Reduction Lemma) there is a decomposition <j) = t v t v -\ . . . ti such that each 
Ti is a Whitehead move and for each i = 1, . . . , p — 1 

\\T i T i - 1 ...T 1 (w)\\ < \\<f>(w)\\ 

with strict inequalities unless ||iu|| = 

Suppose first that ||to|| = ||0(w)||. Then all inequalities above are equalities and 
by strict minimality of w each Ti is either inner or a relabeling automorphism. This 
implies that <j) — ar where a is inner and t is a relabeling automorphism and that 

\{4>) = l. 

Suppose now that \\w\\ < \\cj>(w)\\. Then the preceding argument shows that in 
fact for any z € T(e) we have \\z\\ < \\4>(z)\\ (since otherwise <p would be simple 
and so \\w\\ = \\<j)(w)\\). 

Denote zq — z and Zi — T,-Tj_i . . . t\[z) for < i < p. Thus z p = 4>{z). Since 
||z|| < ||i^)(2)||, there is some i, 1 < i < p such that n is a non-inner Whitehead 
move of the second kind. Let j be the smallest i with this property. Then all Ti 
with i < j are either inner or relabeling automorphisms, \\z\\ — \\z%\\ and Zi S T(e). 
In particular, z^-i € T(e) and Zj—i is strictly minimal. 

Thus 

n=|N| = ||^_ 1 ||<|| % || = ||r J -fe_ 1 )||<||^)||. 

Let (S, a) be the characteristic pair of Tj and let S' = A — S. Since Tj is non- 
inner, we have both \S\ > 2, and \S'\ > 2. Hence |5| \S'\ > 2(2k - 2) and there 
are at least 2(2k — 2) edges between S and 5" in the weighted Whitehead graph of 
Zj—±, Recall that a.A is the total number of occurrences of a^ 1 in z. 

By Lemma 1531 we have ||tj(Zj_i)|| — \\z\\ — S.S' — a.A. 

By assumption on Zj_i we have a.A < n(j- + e) and so 

\\ T j(zj-i)\\ ~ Iki-ill = S.S' - a.A>2n(2k-2) (77^ - e ) - n ( \ + e 



fe(2fe-l) / V fc 



Hence 



||<^(2)|| > ||2j|| > n + 2n(2A:-2) ( yy-y^ — — - e ) - n 1 ' 



fc(2fc-l) / V fc 



and so, since n = \\z\\, 



lW')ll 21 + (4 *_^f__!__«Vri + . 



||z|| - v y Vfc(2fc~ 1) / V fc 

Note that the above inequality holds for any element z € T(e). 
Since T(e) is exponentially C7?.-generic, this implies by Lemma 15.91 that 

A ^ 1+ ( 4fc -%phr e) ^ +£) - 

Since < e < eo w &s arbitrary, it follows that 

This proves that (3) implies (1), so that (1), (2) and (3) are equivalent. 
Choose < ei < eo such that 
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1 + (4fc - 4) (fcpb)- £i )"G +ei ) <C0 = 1 + 4^i- 

Put W = T(ei). The above argument shows that if for some w £ W we have 



z 



z 



<1 + ( 4fc - 4 )(^b)- £l )"G +£l ) 



then <j) is simple. 

With this W we have proved that (5) implies (1). It is obvious that (1) implies 
(4)-(7) and that each of (4), (5), (7) implies (6). Thus statements (1), (4), (5), (6), 
(7) are equivalent. 

We already know that (1), (2) and (3) are equivalent. This completes the proof 
of the theorem. □ 

The following statement, together with Theorem 16.81 implies Theorem IF1 from 
the Introduction. 

Corollary 6.9. Let F — F(ai, . . . , a,k), where k > 2, and d be the word metric on 
F corresponding to the generating set A = {ai, . . . , a/.}. Let 4> £ Aut(F). Then the 
following conditions are equivalent 

(1) The automorphism <f> is simple. 

(2) The map <p : (F, d) — » (F, d) is a rough isometry. 

(3) The map <j> : (F, d) — > (F, d) is a rough similarity. 

Proof. It is obvious that (1) implies (2) and that (2) implies (3). 

We will now show that (3) implies (1). Suppose that <f> is a rough similarity, so 
that there exist A > and D > such that for any w E F 

\\w\-D < \<f>(w)\ < X\w\+D. 

Then obviously A = X(4>). By Theorem 16 . 81 either <f> is simple or X((f>) > 1. 
Assume the latter. Put A = Thus 1 < A < A. 

Consider the ball B n of radius n in F, where n » 1. For any w G F with 
1 10 1 > n/Ao we have 

\<j>{w)\ > X\w\ — D > Xn/X - D>n, 

so that <t>(w) £ B n . 

Thus only the elements of length < n/Ao may be potentially taken to B n by <fi. 
The number of such elements is smaller than #B n since Ao > 1 and n/Ao < n. This 
contradicts the fact that <f> : (F, d) — > (F, d) is a bijection. Therefore 4> is simple, as 
required. □ 

The following is Theorem [H] from the Introduction: 

Theorem 6.10. Let F = F(a\, . . . , a^) where k > 2. Let <j> : F — * X be a free 
minimal action on F on a simplicial tree X without inversions. 
Then exactly one of the following occurs: 

(1) There is a simple automorphism a of F such that X is <p o ot-equivariantly 
isomorphic to the Cayley graph of F with respect to {a±, . . . , a&}. Ln this 
case X(4>) = 1. 

(2) We have A(</>) > 1 + ^^-^. 
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Proof. Let L = X/F and let T be a maximal tree in T and let B = {bi, . . . , bk} 
be the geometric basis of F corresponding to T. Let ip £ Aut(F) be defined by 
a(bi) = Oi for % = 1, . . . , k. 

Note that because of Lemma r4.2l for any cyclic word w over B we have |HU > 
Suppose first that a is not a simple automorphism. Then A (a) > 1+ ^2k-i) • 

Hence for every e > there exists an exponentially C7?.-generic set R(e) C C7Z 
such that for any w £ R(e) 

\\w\\ B _ \\a(w)\\ A > l i 2k - 3 



IHU IHU " fc(2*-i) 

Since > it follows that 

IHIx > x , 2k -3 



||w||^ ~ fe(2fe-l) 
Since e > was arbitrary, it follows by Lemma f5.9l that 

2k -3 , 1 

A(^) > 1 + 77-; r > 1 



fc(2fc-l) " fc(2A-l)' 
as required. 

Suppose now that a is a simple automorphism. We will assume that a = Id, 
and it will be easily seen that the general case is similar. 

If F is a wedge of k loop-edges then the statement of the theorem holds. Suppose 
r is not of this form. Then there exist edges e, e' G E(T — T), e! ^ e _1 , such that 
[t(e), o(e')]T has length at least 1. Let z be the freely reduced word of length 2 in 
B corresponding to the sequence ee'. Let e > be arbitrary. Let C(2,e/2) C C1Z 
be defined as in Proposition 15.51 Thus C(2,e/2) consists of all cyclically reduced 
words w' such that for the cyclic word w = (w 1 ) and for every freely reduced word 
xy in A 

\ fw{xv) ~ 2k{2k-l)\- t/2 - 
Then C(2,e/2) is exponentially C7\L- generic. Let w' £ C(2,e/2) be arbitrary and 
let w = («/). Note that \\w'\\ A = \\w'\\ B = IMU = \\w\\ B and \\w\\ x = \\w'\\x- 
Then 

\\w\\x > IMI-B + n w (z) + n^z^ 1 ) = \ \w\\ A + n w (z) + n w (z~ v ) 

and so 

IKIU IHU ^ , , t i \ , f i -u ^ i , 1 
\MU=WjU- 1 + fw{z) + fwiz ] - 1 + k(2k^-)- £ - 

Since e > was arbitrary, Lemma implies that X((j>) > 1 + k(2k-i) > as rec l u i r ed. 

□ 

7. Application to the geometry of automorphisms 

Definition 7.1. Let F = F(ai, . . . , <!&). An automorphism <fi of F is said to be 
(s, m) -hyperbolic, where s > 1 and m > 1 is an integer, if for every nontrivial cyclic 
word if we have 

s\\w\\ <max{||<TH||,||<r m HII}. 
An automorphism is hyperbolic if it is (s, m)-hyperbolic for some s > l,m > 1. 
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The following lemma is an easy consequence of the above definition: 

Lemma 7.2. Let <\> € Aut(F) be (s , to) -hyperbolic and let w be a cyclic word of 
minimal length in its ((f>) -orbit. Then for any n > 2 we have 

II^HII^^IMI. 

Proof. By definition of hyperbolicity of <fi we have 

(!) ||0-"(u)||<||tt||=>-*||tt||<||^(u)||. 

Note that by the choice of w we have ||w|| < | |0 m (u;)| |. Hence applying (!) with 
u = <p m (w) we get s||</> m (ui)|| < ||0 2m (w)||. Then, using (!), by induction on n we 
see that for any n > 1 

ll^+DHH = ||<T"+"»|| > s\\d> m (w)\\. 

This in turn implies that for any n > 1 

\\4> mln+1 Hw)\\ = > s "|l0 m HII > s"- x INI- 

This proves Lemma 17.21 □ 

The following is Theorem [E] from the Introduction: 
Theorem 7.3. Let <f> be an (s,to) -hyperbolic automorphism of F . Then 



liminf y ' \{4> n ) > l/s > 1. 

n — >oo 

Proof. Let t > 2 be an arbitrary integer. Let w G SM be a strictly minimal 
element. Since w is minimal in its Aut(F)-orbit, it is also minimal in its (</>)-orbit. 
Therefore by Lemma I7~2l 

11^)11 >^IH| and 11^^ > a *" 1 . 

Since SM is exponentially C7?.-generic, Lemma f5 . 91 implies that \(4> tm ) > s t_1 . 
Moreover, there is D > such that for any cyclically reduced word u we have 

11^0)11 > D\\u\l for all < i < m. 
Let n > 2m be an integer. Then we can write nasri= mt + i where t > 2 and 
< i < to. For any w € SM we have 

ll0 n MII = \\4> mt+l (w)\\ > ^||0 mt HII > Ds^WwW 

and hence 

H0 tm MII > Dat -i 
INI 

Since S'A/ is exponentially C7?.-generic, Lemma |5 . 91 again implies that for any n > 
2to 

\(<j) n ) > Ds 1 - 1 = — S L«/H. 

s 

This implies 

liminf > s 1/m > 1, 

n— >cx) 

as claimed. □ 
We can now prove Corollary [I] from the Introduction: 
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Corollary 7.4. Let F = F(ai, . . . , ak) be a free group of rank k > 2, equipped with 
the standard metric. 

Then for any <f> S Aut(F) we have: 



flux((j>) 



0, if <j) is a relabeling automorphism 

1, otherwise. 



Proof. Let A = \(<f>) be the generic stretching factor. 
Suppose first that A > 1. Then the set 

T^ {w eF:^l>^±l 

M 2 

is exponentially F-generic. 

Let B(n) be the ball of radius n in F and let w S B(n) n T be such that 
2n/(A+l) < \w\ < n. Then 

W«01>M— > A + 1 2 =» 

Hence for each to G [-B(n) flT] - B(2n/(X + 1)) we have |</>(w)| > n. The size of 
S(2n/(A+1)) is exponentially smaller than that of B(n) since 2/(A+l) < 1. Hence 
by exponential genericity of T 

#[B(n) ("I T] # J B(2n/(A + 1)) — ^ exponentially f ast _ 
#B(n) 

Hence 

lim = 1 

and therefore flux(cf>) = 1. 

Suppose now that A(</>) = 1. By Theorem 16 . 81 this implies that 4> = ar where a 
is inner and r is a relabeling automorphism. 

If a = 1, then obviously flux((f>) = 0. Suppose now that a is nontrivial. Since r 
acts as a permutation on each ball and each sphere in F, we can assume that r = 1 
and (f> = a. Thus there is u £ F, m ^ 1 such that for every w <E F <p(w) = uwu^ 1 . 
There are > Ci(2fe — 1)™ elements / with |io| = n such that the product uwu -1 
is freely reduced as written, where c\ > is a constant independent of n and u. 
For each such element we have |</>(u>)| > \w\. Hence there is a constant C2 G (0, 1) 
independent of n and it such that for any n > 

!>^)> C2>0 . 

Hence 



l>flux{4>)= lim f feffi > lim ^=1. 
Thus flux(<p) — 1 and the proof is complete. 

□ 



30 



V. KAIMANOVICH, I. KAPOVICH, AND P. SCHUPP 



8. Random elements in regular languages 

The most reasonable way of choosing a "random" element in the regular language 
L is via a random walk in the transition graph of an automaton M accepting L. It 
turns out that the natural model of computation here is that of a non- deterministic 
finite automaton or NDFA. Such an automaton M over an alphabet A with state 
set Q is specified by a finite directed graph T(M). The vertex set of T(M) is the 
set Q of states of M and Q comes equipped with a distinguished nonempty subset 
/ of initial or start states. The directed edges of T(M) are labelled by elements of 
A and these edges are treated as transitions of M. If q E Q is a state and a e A is 
a letter, we allow multiple edges labelled a with origin q and we also allow the case 
when there are no such edges. Nondeterministic automata are thus by their nature 
"partial". There is a distinguished subset of Q of accepting states. A word w over 
A is said to be accepted by M if there exists a directed path with label w in T(M) 
from some initial state to an accepting state. The language, L(M), accepted by M 
is the collection of all words accepted by M. 

We will also use directed graph Ti(M) defined as follows. The vertex set of of 
Ti(M) is the set of directed edges E(T(M)) of M. If ei,e 2 € E(T(M)) the pair 
(ei,e2) defines a directed edge from e\ to e 2 in Y\{M) if the terminus of e\ is the 
origin of e 2 , that is, e\, e 2 is a directed edge-path in T(M). 

Definition 8.1 (Normal Automaton). Let A be a finite alphabet. A normal au- 
tomaton over a finite alphabet A is a nondeterministic finite state automaton M 
over A such that the following conditions hold: 

• the automaton M has a nonempty set of accept states; 

• the directed graph T(M) has at least one edge; 

• the directed graph T(M) is strongly connected, that is for any two states 
q, q' of M there exists a directed edge-path from q to q' in T(M). 

The third condition in the above definition is the most important one as it is 
responsible for the irreducibility of a Markov chain naturally associated to a normal 
automaton: 

Definition 8.2 (Associated Markov chain). Let M be a normal automaton over a 
finite alphabet A. Wc define an associated finite state Markov chain M' as follows. 
The set of states of M' is the set E of directed edges of T(M). If the origin of / 
is not the terminus of e we put the transition probability p e j = 0. If the origin of 
/ is equal to the terminus of e we put p e j — 1/ra, where m is the total number of 
outgoing directed edges from the terminus of e. 

Convention 8.3. Note that the sample space for the Markov chain M' defined 
above consists of all semi-infinite directed edge-paths 

w = ei, e 2 , . . . , e„, . . . 

in the graph T(M). Every such path has a label 

w(lu) = aia 2 

that is a semi-infinite word over the alphabet A. We will denote w n — w n (uj) := 
a\ . . . a n , the initial segment of length n of w. The set £1 comes equipped with the 
natural topology, where we think of Q as the union of boundaries of rooted trees 
(T e ) e <E E. The vertices of T e are finite edge-path in T(M) beginning with e. The 
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Borel cr-algebra on ft is generated by the following open-closed cylinder sets Cylipf), 
where 7 is a nonempty finite edge-path in T{M): 

Cyl{~f) := {uj G il : p is the initial segment of w}. 

If we put an initial probability distribution /i on E, this defines a Borel probabil- 
ity measure on fl. This measure is defined on the cylinder sets by the standard 
convolution formula. If 7 = ei, . . . , e n , where n > 1, then 

P^Cyl{l)) :=M(ei)p ei ,e2Pe2,e3 • ■ • Pe n -i,e n - 
If n = 1 then P jU (CyZ(e)) := /i(e). 

Lemma 8.4. Let M be a normal automaton. Then the associated finite state 
Markov chain M' is irreducible. In particular, there is a unique stationary initial 
■probability distribution [1q on the set of states E of M' . This distribution has the 
property /xo(e) > for each e € E. 

Proof. To show that M' is irreducible we have two prove that for any two edges 
e, / € E there is n > such that the n-step transition probability p^J > 0. Since 
r(M) is strongly connected, there exists a directed edge-path 7 in T(M) from the 
terminus of e to the origin of /. Then ejf is a directed edge-path in T(M) that 
starts with e and ends with /. Hence Ti(M) is strongly connected and therefore 
M' is irreducible. 

The irreducibility of M' implies the existence and uniqueness of a positive sta- 
tionary distribution fi on E, as required. □ 

If we fix an initial probability distribution on E, this defines a probability 
measure P^ on ft. 

Lemma 8.5. Let M be a normal automaton. Let M' be the associated finite state 
Markov chain and let fio be the stationary initial distribution for M' . Let Z C S! 
be a set such that P I10 (Z) = 0. Then for any other initial distribution fi on E we 
have P^Z) = 0. 

Proof. Let [i and /xo be as above. Put 

c := max{-^- : e e E}. 

Mo(e) 

Note that < c < 00 since Ho{e) > for each e <G E. Consider an arbitrary cylinder 
set Cyl{^) C f2, where 7 = e\, e 2 , . . . e„. From the definitions of P M and P Mo we see 
that 

P^Cyl^)) = J^-P^Cyl^)) < cP^Cyim. 
Mo(ei) 

Hence for an arbitrary Borel set Z C f2 we have P^(Z) < cP Mo (Z). In particular, 
ifP Mo (Z) = 0thenP M (Z) = 0. ' □ 

The previous two lemmas depend only on the automaton M being normal. Sup- 
pose now that L — L(M). For each state q choose a shortest path from q to an 
accept state and let u q be the word in A* labelling that path. This is possible since 
T(M) is strongly connected and the set of accept states is nonempty by the assump- 
tion on M. Note that u q is the empty word if and only if q is an accept state. The 
lengths of u q are bounded above by some constant depending on M. For a finite 
walk w n denote w' n — w n u q where q is the state in which w n ends. Note that if w n 
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begins in a state from / then w' n € L. Thus if /i is an distribution supported on the 
set of edges in E(M) with initial vertices from I and w n is obtained by performing 
n steps of the chain M' with initial distribution /x, then w' n € L can be thought of 
as a "random" element of L. 

We can now prove (a slight generalization of) Theorem [j] from the Introduction: 

Theorem 8.6. Let M be a normal automaton over the alphabet A and let L = 
L(M) be the language accepted by M . 

Let 4> : A* — > G be a monoid homomorphism, where G is a group with a left- 
invariant semi-metric da- Then there exists a number A = \{M, 0, da) > such 
that for any initial distribution fi on E{M) we have 



lim 



It/H^rOlc _ \jK — rJk? _ ^ a l m ost surely and in L 1 with respect to P M 



Proof. Let /iq be the unique stationary initial distribution for M' . As before denote 
by S : f2 — > O the shift operator which erases the first edge of every w = e\, e2, • • • € 
fl. Stationarity of /io means that S : (f^P^,) — > (ri,P Mo ) is a measure-preserving 
map. Since M' is irreducible and aperiodic, iS is also ergodic. 
As before, define X n : £1 — > R as 

X n (w) := |(/)(w„(w))| G . 

Then again it is easy to see that X n > 0, X n+m (oj) < X n (oj) + X m (S n oj). Hence 
by the Subadditive Ergodic Theorem there is A > and there is a subset Q C 
with P Mo (Z) = such that for any ui ^ Z 

\4>{w n {uj))\ G 
lim = A. 

n — >oc n 

Let /x be an arbitrary initial distribution on E. Then by Lemma 18.51 we have 
P^Z) = 0. Thus 

\4>(w n )\G _^ ^ almost surely with respect to P„. 
n 

Note that by the left-invariance of da we have < K\w\ where K = 

max{|0(a)|c : a S A}. Hence X n /n = |0(w„)|g/« < K and by the Dominated 
Convergence Theorem almost sure convergence of X n /n implies Z^-convergence. 

Since d& is a seminorm on G and the length of any path w' n differs from \w n \ by 
at most a fixed constant, it is also true that \(j)(w' n )\G differs from \(j)(w n )\G by at 
most a fixed constant and thus it is also the case that 

lim M<M)| G = lim = x 

n-^-oo ft n—*oo 77, 

□ 

There is substantial flexibility in the choice of the Markov chain M' , The proof of 
Theorem goes through without change for any choice of transition probabilities 
in M' such that p e j > whenever (e, /) is an edge of Ti (M) and p e j — whenever 
(e, /) is not an edge of Fi(M). 
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9. Open Problems 

Problem 9.1. Let <p be an arbitrary (not necessarily injective) endomorphism of 
F = F(ai, . . . , dfe). Is X(4>) rational? Computable? 

Problem 9.2. Let <p € Aut(F). What can be said about the behavior of \(4> n ) as 
n — > oo? Same for y/\(<fi n ). How are these quantities connected with growth rates 
of different (or perhaps just top) strata from relative train-track representatives of 
0? 

It is clear that the asymptotics of A (</>") should reflect the dynamical proper- 
ties of (/). For example, it is not hard to see that for any Nielsen automorphism r 
the stretching factor A(r") grows at most linearly and limsup,^^ yA(r") = 1. 
On the other hand for hyperbolic automorphisms <f> Theorem 17.31 implies that 
liminfn^oo \J ' \{(j) n ) > 1, so that the sequence {\{<j) n )) n grows exponentially. 

Problem 9.3. Can one estimate (say in the sense of Large Deviations) the speed 
of convergence \^^nS. _> A(</>)? 

We have seen that in the case of free group automorphisms for any e > 

v n 

with exponentially fast convergence as n — > oo. Are there any other situations 
where the speed of convergence in Theorem lAl can be estimated? 

Problem 9.4. Let F = F(ai, . . . , cifc) where k > 2. Consider the set 

W = {X((j>) : <j) '■ F — ► Aut(X) is a free simplicial action of F on some simplicial tree X}. 

We know that W C Q and, moreover 2kW C 
Is a discrete subset of Q? 

Problem 9.5. The notion of a generic stretching factor for <fi £ Aut(F) depends 
on the choice of a free basis b = (ai, . . . , cifc) of F, and, more generally, on the choice 
of a finite generating set S of F and the corresponding word metric ds ■ Denote by 
Xs(<t>) the generic stretching factor of <j) considered as a map {F,ds) — * (F,ds)- 
One can define the following uniform constants 

\'(<f>) ■= inf{Afc(0) : b is a free basis of F} 

and 

A"(</>) := inf{As(</>) : S 1 is a finite generating set of F}. 
(Note that \"(<fi) can be defined in the same fashion for an automorphism <f> of an 
arbitrary finitely generated group G). 

For <f> £ Aut(F) are the constants A'(</>) and \"(<fr) actually realized by some free 
bases and finite generating sets of F accordingly? That is, are the above infima 
actually minima? Are \'{4>) and \"{<f) algorithmically computable? 

Similarly we can define 

||0||' = inf{||^||b : b is a free basis of F} 

and 

\\(f>\\" = 'mf{\\(j)\\s ■ S is a finite generating set of F}. 
Since both of these constants are integers, they are clearly realizable by some b and 
S accordingly. Are these constants algorithmically computable? 
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